Faint Source Counts from Off-source Fluctuation Analysis on 
Chandra Observations of the Hubble Deep Field-North 
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ABSTRACT 

We report the results of fluctuation analysis of the off-source field from the 1 Million second 
Chandra Observation of the Hubble Deep Field-North (HDF-N). The distribution of the counts 
in cells has been compared with the expectations from the Log N — Log S model to constrain the 
behavior of the source number density down to a factor of several lower than the source-detection 
limit. Our results show that the number counts in the soft band (0.5-2 [keV]) continue to grow 
down to S x ~ 7 x 10 -18 [ergs -1 cm -2 ], possibly suggesting the emergence of a new population 
and agree well with a prediction of star forming galaxies by Ptak et al. (2001). For the hard 
(2-10 keV) band, the fluctuation analysis can loosely constrain the source counts fainter than 
the detection limit and we found an upper limit of < 10000 [sres deg -2 ] at S x ~ 2 x 10 -16 
[erg s -1 cm -2 ]. 
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The number count of X-ray sources as a func- 
tion of flux (the so called the Log TV — Log S rela- 
tion) is one of the key constraints for models of the 
X-ray source population. Most of the "Cosmic X- 
ray Background" (CXRB) intensity has now been 
resolved with "Chandra" (Mushotzky et al. 2000; 
Tozzi et al. 2001; Brandt et al. 2001b) to the ex- 
tent that major uncertainties in the fraction of 
the CXRB which have been resolved into individ- 
ual sources, i.e, how much of it remains to be ex- 
plained, lie in the absolute intensity of the CXRB 
and field-to-field fluctuations due to cosmic vari- 
ance. In terms of the origin of the CXRB, the 
faintest sources in the Chandra Deep Fields are 
becoming less interesting. However, constraints 
on number counts at the faintest possible fluxes 
provide a new view on the nature and evolution of 
the X-ray emitting sources. 

Fluctuation analysis is a strong tool for con- 
straining the source counts below the source de- 
tection limit. This technique has been successfully 



applied to data from previous missions (Hasinger 
et al. 1993; Georgantopoulos et al. 1993; Gendreau 
et al. 1998; Yamashita et al. 2000; Perri & Giommi. 
2000) and the source counts inferred from these 
analyses have turned out to be consistent with 
those from resolved sources in deeper observations. 
The Chandra observations of the Hubble Deep 
Field North (HDF-N) and Chandra Deep Field- 
South (CDF-S), with an exposure of about 1 Ms 
each, are the deepest X-ray imaging data obtained 
so far. Fluctuation analyses on these fields are an 
essential step in pushing the limit of the source 
counts to even fainter fluxes. 

In this letter, we report our initial results of 
the fluctuation analysis of the 1 Ms of the Chan- 
dra observation of HDF-N. In this first analysis, 
we have assumed that the sky fluctuation solely 
comes from unresolved point sources (AGNs and 
galaxies). An analysis including the CDF-S, fur- 
ther HDF-N exposures, and possible effects of ex- 
tended X-ray emission will be reported in a future 
paper (paper II, in preparation). In this letter, 
we use H = 65 [kms -1 Mpc -1 ], f2 m = 0.3, and 
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ft A = 0.7 unless otherwise noted. 

2. Data Preparation 

The Chandra X-ray Observatory (CXO) was 
used to observe the HDF-N field with its ACIS- 
I detectors. The total exposure time attained as 
of Spring 2001 has been 975 [ks] over 12 obser- 
vations and the data, which have been aligned 
to an astrometric accuracy of <~ 0'.'5, have been 
fully archived 1 . We have made use of the Level 1 
merged event list from the archive to create im- 
ages for our analysis. The charge transfer ineffi- 
ciency (CTI), has been corrected according to pro- 
cedures in Townsley, Broos, Garmire, & Nousek 
(2000). After this correction, we have filtered 
events to remove hot pixels, columns and flaring 
events flagged in the archived eventlist. 

In order to reduce the particle background as 
much as possible, we have used the restricted 
grades (Brandt et al. 2001b), i.e., fltgrade=0 ,64 for 
the soft (0.5-2 kcV) band and fltgrade=0,2,8, 16,64 
for the hard (2-8 keV) band respectively. Us- 
ing these restricted grade sets, we removed 46% 
and 25% of the off-source events (non X-ray back- 
ground dominated), while removing only 15% and 
13% of source events for the soft and hard bands 
respectively compared with the standard selec- 
tion (gra<ie=0,2,3,4,6). There are two locations 
where CXO pointings are concentrated in the 
1 Ms of the HDF-N data, one near (a,S) = 
(189?2680,62?2150) and the other near {a, S) = 
(189?1476,62?2437). For the fluctuation analy- 
sis, we have chosen a region which is an inter- 
section of two 5f2-radius circles centered at these 
two locations. We have also excluded the locations 
corresponding to the gaps between CCDs by im- 
posing a total exposure value of at least 800 [ks] . 
The cell size used for making histograms of the 
detected counts was 4" x 4". which was chosen 
to match the largest point-spread function (PSF) 
in this area. The region contains neither of the 
two extended sources discussed by Brandt et al. 
(2001b). The flux (before Galactic absorption)- 
to-countrate conversion factors have been calcu- 
lated using PIMMS 2 assuming a T = 1.4 power- 
law with a Galactic absorption of Ah = 1.6 10 20 

1 http://asc.harvard.edu/udocs/ao2-cdf-download. html 
2 http://heasarc. gsfc.nasa.gov/Tools/w3pimms. html 



[cm 2 ] corrected for the loss of events for using 
the restricted grade set. 

2.1. Point Source Detection and Removal 

We have used the wavdetect utility distributed 
as a part of the Ciao 2.1 package 3 to find sources 
to be masked out from the image to be ana- 
lyzed. We have used a false detection proba- 
bility threshold of 1 10~ 6 and wavelet scales of 
1,V^,2,2V2,4,4V2,8 pixels on O'.'98-pixel im- 
ages. The detection completeness has been ex- 
tensively investigated using running the same 
source detection procedures on simulated images 
(Sect. 3. 2). The sources with wavelet-detected 
counts larger than 12(18) counts, corresponding to 
5 x i 6 = 0.7(6.) in 0,5-2 (2-10) [kcV] (here and here- 
after, S* x i6 represents an X-ray flux measured in 
10~ 16 [ergs _1 cm~ 2 ]) have been masked out from 
the field for the soft (hard) band. The simula- 
tion shows that less than one source per field fails 
to be detected above these thresholds. We take 
conservatively large radii for the source exclusion 
region, ranging from 6" to ~ 18", depending on 
the wavelet-detected source size and counts. Af- 
ter the source removal, the remaining area for 
the fluctuation analysis was 7848(8043) pixels or 
34.9(35.7) arcmin 2 for the soft(hard) band. 

Sources detected in the same procedure over a 
larger area of the sky (within 6.'3 from both of the 
pointing centers; 78 [arcmin 2 ]) are used to give 
the "resolved source" constraints on the fluctua- 
tion analysis discussed in Sect. 3.3 and to calcu- 
late the resolved source Log N — Log S shown in 
Sect. 4 

3. Off-source Fluctuation Analysis 
3.1. Overall Procedure 

Using the off-source map generated above, we 
have searched for the behavior of Log N — Log S 
below the source detection limit which is con- 
sistent with the off-source fluctuation. For the 
model, a broken power-law form for the cumula- 
tive source count has been assumed: 

m ^ «x _ J N Q (S/S max )-^ : (S h < S < 5 max ) 
[> \ N(> Sb){S/S b )-v : (5 min < S < S h ) 

3 http://asc. harvard.edu/ciao/ 



, where S m i n and S m!iX are the minimum and max- 
imum fluxes between which we would like to con- 
strain the behavior. We set S max as the limiting 
flux of the complete source detection and S m i n as 
the flux corrsponding to <~ 1 count in the image. 
Confidence level searches in the parameter space 
have been made for two cases, each with two free 
parameters: (1) Single power-law with free param- 
eters (iVo,7i), and (2) Broken power-law with free 
parameters (5b 72), where the Nq and 71 values 
fixed to the best-fit of the single power-law case. 
The second power-law component helps determine 
the practical sensitivity limit of this analysis, fol- 
lowing the approach by Hasinger et al. (1993). For 
each point in the parameter space, we have run 
Monte-Carlo simulations and compared simulated 
and observed histograms of the number of pixels 
as a function of the number of detected counts (the 
so-called P(D) diagram) as detailed in the follow- 
ing subsections. By running the simulations over 
the parameter space, we have determined the sub- 
space where the model was accepted at a 90% con- 
fidence level. For the accepted subspace, we have 
calculated the minimum and maximum N(> S) 
values at each flux between S'min and S m!iX to find 
final constraints on the Log N — Log S behavior. 
The resuling constraints are shown and discussed 
in Sect. 4 

3.2. The Image Simulation 

The image simulation has been made using sim- 
ulation software developed by one of the authors 
(TM). It has been designed to meet the needs of 
massive Monte- Carlo simulations of a summed im- 
age from multiple observations with offset point- 
ings. For each observation, it uses a separate ex- 
posure map and a different pointing position, from 
which the off-axis angle is calculated. 

An image simulation runs as follows. Firstly, 
random point sources are generated based on an 
input Log ./V — Log S model. For a given input 
source (with a sky position and a physical flux), 
the simulator generates Poisson-deviated number 
of events based on the maximum exposure value 
of each observation. The off-axis angle is calcu- 
lated from the pointing direction of the observa- 
tion and the simulator spatially spreads the gener- 
ated events based on the off-axis angle dependent 
PSF. For each of the PSF-deviated event, a ran- 
dom number between and 1 is generated. The 



event is rejected if this random number is larger 
than the ratio of the exposure value at the point 
and the maximum exposure. This two-step proce- 
dure assures the correct treatment near the CCD 
edges and bad columns. The above steps are re- 
peated for all the observations and all the gener- 
ated sources. Background events are then added 
to the image in such a way that the total count of 
the sources and the background is the same as that 
of the real data. The background events have been 
distributed assuming that they are dominated by 
the particle background and it is uniform over the 
active pixels of the CCDs and not affected by the 
vignetting of the CXO telescope. 

We have forced the total count of the simulated 
image to be equal to the real data, instead of also 
considering the Poisson deviation of the residual 
background count. This is because the underlying 
background rate is not an interesting parameter 
here and in order to assess the confidence range 
of interesting parameters (the Log N — Log S pa- 
rameters in this case) for given data, uninteresting 
parameters should be adjusted to give the best fit. 
In our case, forcing the total counts to be equal 
automatically makes this adjustment. 

3.3. Statistics 

The probability distribution of the P(D) his- 
togram is more complicated than the simple Pois- 
sonian or Gaussian, involving source and photon 
fluctuations with correlated errors. Thus using an- 
alytical formulae of the probability distributions 
of common statistical measures (e.g. the \ 2 dis- 
tribution), gives inaccurate results. Thus we use 
Monte-Carlo simulations to calculate the probabil- 
ity distributions. With Monte-Carlo simulations, 
the probability that a model is correct can be es- 
timated using any reasonable estimator which in- 
dicates the deviation between the model and the 
data. In this work, we have used a modification 
of the Cash (1979) estimator by Caster, as used in 
the XSPEC 11.1 (Arnaud & Dorman 2001), the 
change of which usually follows the x 2 probability 
distribution: 

C = 2^[AT mdli -AT dati +Ar dati (ln JV dati -ln AUiJ], 

i 

(2) 

where -/V dati and JV m di ( are the observed and 
model-predicted numbers of pixels having i counts 
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(0 < i < n) or the number of resolved sources 
(i = n + 1, observed or predicted; see below) re- 
spectively. The sum is over 2 < i < n + 1. We 
have excluded i = 0, 1 from the sum because these 
two bins are not independent from the others (the 
total numbers of pixels and events are fixed). As 
an additional constraint, a term (i = n+ 1) where 
the number of resolved sources above the defined 
flux limit is -/Vd a t,n+i and its model-predicted value 
^mdi.n+i has been included in Eq. 2. This term 
constrains the Log N — Log S at the brighter end 
of the fluctuation analysis. 

For a given model, 1000 simulations have been 
made and the mean of the simulated results has 
been taken as N^i^i as Hasinger et al. (1993) did. 
However, instead of using the y 2 distribution, we 
have used the distribution of 1000 C values be- 
tween this N m di,i an d the 1000 simulated Nd at ,i 
histograms. If > 10% of the simulated C values 
are greater than that between the model and the 
real data, the model has been accepted at a 90% 
confidence level. 

4. Results and Discussion 

The resulting constraints (90% confidence 
range) on the Log N — Log S behavior in the 
0.5-2 [keV] and 2-10 [keV] bands (the latter have 
been scaled from 2-8 [keV] assuming a F = 1.4 
power-law spectrum) obtained from our analysis 
are shown in Fig. 1 along with a number of recent 
results from the literature in the flux regime of in- 
terest. The resolved source counts (Sect. 2.1) have 
also been plotted with lcr error bars. 4 These are 
consistent with those from Brandt et al. (2001b), 
who used sources from a smaller part of the same 
general area to avoid incompleteness. Our re- 
solved source counts are from a larger area and 
are plotted down to fluxes which were not affected 
by incompleteness as determined from the simu- 
lations. The fluctuation results have been plotted 
down to a flux where the number counts have been 
reasonably constrained. We have set the faintest 
sensitivity limit at the flux where the difference 
between the upper and lower bounds is a factor 
of three. We note that we could determine the 
sensitivity limit in this way by exploring the pa- 

4 ASCII tables of the results can be found in 
the sre distribution of this preprint and at 
http: / /astrophysics. phys.cmu.edu/~dccpxray /tables 



rameter space in a broken power-law form (Eq. 
1), and not just the single power-law model. 

For verification, we have also overplotted the 
results of our previous analysis, which utilized es- 
sentially the same procedure, when only 220 [ks] 
of CXO data were available on this field (Miyaji et 
al. 2000b). As 1 Ms of data have been obtained, 
fainter sources have now been resolved. The re- 
solved source counts in the 1 Ms of observation 
agree well with the 220 ks fluctuation results. 

Fig. 1 shows that the deep resolved-source 
counts in CDF-S by Campana et al. (2001), whose 
faint limit was attained by full corrections for in- 
completeness and the Eddington bias, are some- 
what below the lower limit of our fluctuation re- 
sults in both energy bands. Comparing a num- 
ber of results from the literature (Tozzi et al. 
2001; Campana et al. 2001; Brandt et al. 2001b; 
Mushotzky et al. 2000), the X-ray source counts 
from CDF-S, as well as those of the Lockman 
Hole observed with XMM-Newton(Hasinger et al. 
2001), are in general lower than those of the HDF- 
N region by < 20 — 40%, depending on the work 
cited and flux range. This probably reflects ac- 
tual cosmic variance in the sky with some minor 
contribution from different calibrations and source 
detection techniques. 

The most significant result from our analysis 
is that the 0.5-2 [keV] Log N — Log S continues 
to grow down to S x i 6 - 0.07. The 2-10 [keV] 
Log N — Log S fluctuation results are consis- 
tent with both starting to saturate and contin- 
uing to grow below 5 x i6 < 7, while some loose 
constraints can be made (e.g. N(> S) < 10000 
at S x iq = 2). Here we focus our discussion on 
the implication of our results in the soft band. 
From the deepest ROSAT surveys (Lehmann et 
al. 2001), the X-ray sources at 5 x i6 > 20. turns 
out to be dominated by AGNs, while at fainter 
fluxes, difficulties in complete optical identifica- 
tions of the faint sources detected in CXO/XMM- 
Newton (S x iq > 1) leave us with tantalizing situa- 
tions in the nature and redshifts of these sources. 
There still exist large uncertainties in the behav- 
ior of the soft X-ray luminosity function (SXLF) 
of AGNs at moderately low luminosities (L x <~ 
10 42 _ 1Q 43 [ergs" 1 ]) at z ~ 0.5 - 2. Miyaji, 
Hasinger, & Schmidt (2000a, 2001) constructed 
an SXLF of AGNs (L x > 10 41 - 5 [crg S - 1 ]) using 
ROSAT surveys with a very high degree of com- 
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Fig. 1. — Fluctuation and resolved source Log N — Log S relations in the 0.5-2 and 2-10 keV bands are 
shown with those from a number of other works as labeled. The solid circles are from the resolved-sources 
(Sect. 2.1). The solid horns arc from the fluctuation analysis. Results from a number of recent publications 
are overplotted as labeled. For a verification, the results of our previously reported analysis for the 220 ks of 
data (Miyaji et al. 2000b) are also shown. They correctly predicted the results of the resolved source number 
counts of the much deeper 1 Ms data. 



pleteness in spectroscopic identifications down to 
S x ~ 20. They discussed two extremes of possible 
extrapolations of the SXLF behavior in the frame- 
work of the luminosity-dependent density evolu- 
tion (LDDE) picture. One of them has rapidly 
dropping density-evolution rates as luminosities 
become lower (L x < 10 44 [ergs^ 1 ]) (LDDE1) and 
the other has only a moderate drop in evolution 
rate in such a way that AGNs alone can make ~ 
90% of the soft X-ray background (LDDE2). The 
LDDE2 model overpredicts the number count by 
a factor of two at 5x16 = 1 and thus can be consid- 
ered to be rejected by the new data. We have over- 
plotted the 0.5-2 keV Log N — Log S prediction 
from the AGN population synthesis model com- 
posed of unabsorbed and absorbed AGNs based 
on the LDDE1 model by Gilli, Salvati, & Hasinger 
(2001) (model B). The plotted model have been 
slightly modified from the original as described 
in Rosati et al. (2001). The curves for the to- 
tal and absorbed (A H > 22 [cm -2 ]) AGN pop- 
ulations are plotted. Fig. 2 shows that the AGN 
counts drop below 5 x i6 = 1 even if the emergence 
of the obscured AGN population is taken into ac- 
count, while the source counts from the fluctua- 



tion analysis continue to grow. If their model B 
closely represents the true behavior of the SXLF 
of the AGN population, we are probably seeing 
the emergence of a new population of faint X-ray 
sources. This excess may be contributed by AGNs 
with low intrinsic luminosities (at L x < 10 41 ' 5 
[erg s" 1 ] ) and/or X-rays from star- formation activ- 
ities through supernova remnants and low-/high- 
mass X-ray binaries. In view of this, we have also 
overplotted a model prediction of X-ray number 
counts based on the cosmic star-formation rate 
(Ptak et al. 2001) for the two models of binary 
evolution (Gaussian and Peak-M) from Ghosh & 
White (2001). Within the uncertainties in the 
AGN source counts and model construction, these 
predictions gave approximately the number counts 
from our fluctuation analysis. This is also consis- 
tent with the results of stacking analysis of bright 
galaxies by Brandt et al. (2001a). 

Finally we describe a number of caveats to be 
considered in interpreting our analysis. The pos- 
sisble inhomogeneities in the CCD quantum ef- 
ficiencies and particle backgrounds, and the exis- 
tence of low-surface brightness diffuse sources (un- 
resolved galaxy groups and possible intergalactic 



5 



10 6 


1 I, 


i i 

0.5-2 keV 1 


10 4 






10 3 




1 Ms HDF-N Source ^\ | ~ 




: c=i 


CXO HDF-N 1 Ms Fluctuation 

— AGN total/obscured (Gilli et al. '01. modified) N - 




/••• 

in 


■ -■ Galaxies with Gaussian/Peak-M V 
binary evolutions (Ptak et al. '01) 



10 -|B 1Q-J7 [Q-18 10 -15 

S x [erg s _1 cm -2 ] 



Fig. 2. — Derived Log N — Log S relation in 
the soft band is compared with AGN and galaxy 
number count models. Predictions from the pop- 
ulation systhesis model (B) by Gilli, Salvati, & 
Hasinger (200f ) for the total AGN population and 
the obscured (AT H > 10 22 [cm -2 ]). Predicted 
number counts of galaxies using the cosmic star- 
formation history and two modes (Gaussian and 
Peak-M) of evolution of X-ray binaries (Ptak et al. 
2001) are also plotted. If mode B of Gilli, Salvati, 
& Hasinger (2001) represents the correct behav- 
ior of absorbed and unabsorbed AGNs, the fluc- 
tuation constraints suggest the emergence of an 
additional population, probably from galaxies. 

medium) would all work in the directions of in- 
creasing the apparent number counts inferred from 
the fluctuation analysis. The excellent agreement 
between the 220 ks fluctuation results and 1 Ms 
source counts shows that at least the first effect 
is negligible. Furthermore, summing data from 12 
observations have further smoothed out the first 
two effects. However, actual diffuse structure in 
the sky may come into effect at this very faint 
level. This aspect will be discussed in paper II. 
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